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BACKGROUND AND OBJECTIVES: One of the criteria for deciding to invest in a listed 
company is the amount or changes in the stock price of the company in the future days 
and months. Various methods have been studied to predict the stock price and investment 
risk in a company. In most of these methods, the stock price is predicted as a continuous 
response variable. For this purpose, time series models are used in which assumptions such 
as the normality of disturbances or the linearity of the model are important. The purpose 
of this research is to introduce a two-category response variable based on the direction 
of share price movement in the next day and to introduce some statistical classification 
methods to predict it. These models do not have the limitations of the previous models, 
and for that reason, they are of interest. The main objective of this article is to implement 
he studied methods and compare their accuracy in predicting the orientation of stock price 
movement of stock exchange insurance companies. 
(METHODS: Inthe current research, we have predicted the direction of stock price movement 
by using K-nearest neighbors, decision trees, and random forest algorithms, which are 
among the non-parametric classification methods of statistical learning. The data used in 
his research includes information on the stock price of one of the insurance companies 
during the years 2019 to 2020, which has a suitable and high share in the portfolio of 
he insurance industry. To determine the accuracy of the studied models, the data were 
randomly divided into two groups, training and testing. Then, the statistical learning models 
were implemented on training data and their validity was measured using experimental 
data. 
FINDINGS: The research results indicate the high accuracy of all three non-parametric 
models in predicting the stock price category of the insurance company. Likewise, among 
the studied models, the K-nearest neighbors algorithm performed better than other 
algorithms in predicting the direction of stock price movement. 
CONCLUSION: Considering the importance of the risk of investing in an insurance company 
for customers, attainment to a valid model for stock price classification and specifying the 
variables that increase or decrease the price can help customers and insurance companies 
make better decisions. 
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Table 1: Independent variables using in classification models and their definition. 
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Table 2: Descriptive information on stock price of the Saman insurance company 
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Figure 2: The framework of the proposed model 
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Table 3: A confusion matrix sample 
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Figure 3: The accuracy of the KNN models with different k for Saman insurance company data 
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Table 4: Confusion matrix of the KNN model for testing data of the Saman insurance company 
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Figure 4: Decision tree for Saman insurance company data 
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Table 5: Confusion matrix of the decision tree for testing data of the Saman insurance company 
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Table 6: Importance measures for each independent variables in the decision tree 
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Figure 5: The accuracy of the random forests with different values of ntree and mtry for Saman insurance company data 


lols dan ES ph ctsloj] claosls gly bolas [Siz Jaro julolyy 9 Cortly Goluly Gator, Anal V Joe 
Table 7: Confusion matrix of the random forest for testing data of the Saman insurance company 
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Table 8: Importance measures for each independent variables in the random forest 
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Table 9: The accuracy of different classification models for the Saman insurance company data 
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Figure 6: The ROC curves for the KNN, decision tree and random forest and their AUCs for Saman insurance data 
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